Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Old spellings, new methods: automated procedures for indeterminate linguistic data

Identifieur interne : 000058 ( Main/Exploration ); précédent : 000057; suivant : 000059

Old spellings, new methods: automated procedures for indeterminate linguistic data

Auteurs : Hugh Craig [Australie] ; R. Whipp [Australie]

Source :

RBID : ISTEX:5B54964A7C7BF5666057D6D7AC052AAF4B75F34F

Abstract

The authors have worked over several years on a software tool to make word counts from an archive of old-spelling early modern English plays and poems. In this article we present the outcome, a computational model for dealing automatically with variant spelling, implemented in an application which we call an Intelligent Archive. We also reflect on the perspective on Early Modern English, and on the probabilistic aspect of language in general, gained from working through the practical problems which arose in establishing the model.

Url:
DOI: 10.1093/llc/fqp033


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Old spellings, new methods: automated procedures for indeterminate linguistic data</title>
<author>
<name sortKey="Craig, Hugh" sort="Craig, Hugh" uniqKey="Craig H" first="Hugh" last="Craig">Hugh Craig</name>
</author>
<author>
<name sortKey="Whipp, R" sort="Whipp, R" uniqKey="Whipp R" first="R." last="Whipp">R. Whipp</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5B54964A7C7BF5666057D6D7AC052AAF4B75F34F</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1093/llc/fqp033</idno>
<idno type="url">https://api.istex.fr/document/5B54964A7C7BF5666057D6D7AC052AAF4B75F34F/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000508</idno>
<idno type="wicri:Area/Istex/Curation">000508</idno>
<idno type="wicri:Area/Istex/Checkpoint">000029</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000029</idno>
<idno type="wicri:doubleKey">0268-1145:2010:Craig H:old:spellings:new</idno>
<idno type="wicri:Area/Main/Merge">000058</idno>
<idno type="wicri:Area/Main/Curation">000058</idno>
<idno type="wicri:Area/Main/Exploration">000058</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Old spellings, new methods: automated procedures for indeterminate linguistic data</title>
<author>
<name sortKey="Craig, Hugh" sort="Craig, Hugh" uniqKey="Craig H" first="Hugh" last="Craig">Hugh Craig</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Australie</country>
<wicri:regionArea>Centre for Literary and Linguistic Computing, School of Humanities and Social Science, The University of Newcastle</wicri:regionArea>
<wicri:noRegion>The University of Newcastle</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Australie</country>
</affiliation>
</author>
<author>
<name sortKey="Whipp, R" sort="Whipp, R" uniqKey="Whipp R" first="R." last="Whipp">R. Whipp</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Australie</country>
<wicri:regionArea>Centre for Literary and Linguistic Computing, School of Humanities and Social Science, The University of Newcastle</wicri:regionArea>
<wicri:noRegion>The University of Newcastle</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2010-04">2010-04</date>
<biblScope unit="volume">25</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="37">37</biblScope>
<biblScope unit="page" to="52">52</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">5B54964A7C7BF5666057D6D7AC052AAF4B75F34F</idno>
<idno type="DOI">10.1093/llc/fqp033</idno>
<idno type="ArticleID">fqp033</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">The authors have worked over several years on a software tool to make word counts from an archive of old-spelling early modern English plays and poems. In this article we present the outcome, a computational model for dealing automatically with variant spelling, implemented in an application which we call an Intelligent Archive. We also reflect on the perspective on Early Modern English, and on the probabilistic aspect of language in general, gained from working through the practical problems which arose in establishing the model.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Australie</li>
</country>
</list>
<tree>
<country name="Australie">
<noRegion>
<name sortKey="Craig, Hugh" sort="Craig, Hugh" uniqKey="Craig H" first="Hugh" last="Craig">Hugh Craig</name>
</noRegion>
<name sortKey="Craig, Hugh" sort="Craig, Hugh" uniqKey="Craig H" first="Hugh" last="Craig">Hugh Craig</name>
<name sortKey="Whipp, R" sort="Whipp, R" uniqKey="Whipp R" first="R." last="Whipp">R. Whipp</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000058 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000058 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:5B54964A7C7BF5666057D6D7AC052AAF4B75F34F
   |texte=   Old spellings, new methods: automated procedures for indeterminate linguistic data
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024